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Abstract: The precise estimation of the satellite clock bias (SCB) holds considerable 
importance in ensuring accurate timekeeping, navigation, and positioning. This study 
introduces a novel SCB prediction approach that integrates variational mode decomposition 
(VMD) and long short-term memory (LSTM) network techniques, combining signal 
decomposition with deep learning methodologies. Initially, the raw SCB data undergoes 
preprocessing, followed by decomposition using the VMD method to generate multiple 
intrinsic mode functions (IMFs). These decomposed IMFs serve as inputs for LSTM, where 
several independent LSTM models are established for training and prediction purposes. 
Subsequently, the predicted outcomes are aggregated and reconstructed to derive the final 
SCB prediction. Experimental findings demonstrate notable advancements in clock bias 
prediction for the spaceborne hydrogen atomic clock for BDS, with prediction accuracies of 
0.048 ns, 0.204 ns and 1.397 ns for 6 hours, 3 days and 15 days, respectively. These results 
exhibit significant enhancements compared to both the LSTM network and the Back 
Propagation (BP) neural network, with improvements of 56%, 84% and 83% for the 
aforementioned time intervals in comparison to LSTM, and enhancements of 59%, 82% and 
83% relative to the BP neural network. 
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1 Introduction 


The atomic clock serves as a highly precise timekeeping device and is a fundamental 
component of satellite navigation systems. Its stability and accuracy play a critical role in 
ensuring precise time synchronization across various fields such as modern science, 
navigation systems, and communication measurements. However, the atomic clock's stability 
may be affected by external factors over time. The real-time clock bias data provided by the 
International GNSS Service (IGS) may not be sufficient for achieving real-time precision in 
single-point positioning of satellite navigation systems. Additionally, the subsequent release 
of the final precision clock bias data is typically delayed by 13 days (IGS, 2024). Therefore, 
the accurate forecasting of atomic clock bias is essential for achieving precise navigation and 
positioning, as well as for maintaining accurate timekeeping, time synchronization, and 


frequency control. 

In recent decades, numerous researchers have developed various models for predicting 
the satellite clock bias (SCB), broadly categorized into two groups: frequency domain 
analysis methods, such as spectral analysis (SA) (Zheng et al. 2010; Zhao et al. 2021), which 
decomposes time series data into different frequency components to analyze and predict their 
periodicity and frequency characteristics. However, the SA method necessitates a long-term 
and stable clock bias sequence, which is not feasible for the long-term forecasting of SCB due 
to its non-stationary long-term change (Heo et al. 2010). The other category comprises time 
domain analysis methods, including quadratic polynomial (QP) (Huang et al. 2011; Huang et 
al. 2018), GM(1,1) (Lu et al. 2008; Liang et al. 2016), autoregressive integrated moving 
average (ARIMA) (Xu et al. 2009; Xi et al. 2014; Zhang et al. 2022), Kalman Filtering model 
(Davis et al. 2012; Zhu et al. 2008), exponential smoothing method (Wang et al. 2017; Yu et 
al. 2020), and machine learning methods (Lu et al. 2023). QP, based on atomic clocks' 
physical properties, offers a clear physical interpretation but exhibits increasing prediction 
errors with longer forecast horizons and lacks the ability to capture clock bias' nonlinear 
dynamic characteristics (Jonsson and Eklundh 2002; Cui and Jiao 2005; Lei et al. 2022). The 
GM(1,1) model requires minimal sample data, simple modeling, and demonstrates good 
long-term forecasting performance, yet it necessitates a smooth and exponential change 
pattern in clock bias series, limiting its applicability (Lu et al. 2008). ARIMA can identify 
trends, seasonality, and cyclicity in time series data for future predictions but requires smooth 
data, making it unsuitable for long-term forecasting (Lei et al. 2022). The Kalman Filtering 
model predicts clock bias by recursively updating the state and calculating filtering results 
using system state and observation models. While effective in handling noise and uncertainty, 
it struggles with nonlinear problems and long-term dependencies, requiring accurate 
determination of process and observation noise to prevent errors in state estimation and 
reduce prediction accuracy (Lei et al. 2022). Exponential smoothing is typically suitable for 
short- and medium-term forecasts but may underperform with data exhibiting long trends or 
complex nonlinear relationships (Lei et al. 2022). Machine learning methods for clock bias 
prediction encompass support vector machine (SVM) (He et al. 2019), BP neural network 
(Bai et al. 2023), wavelet neural network (WNN) (Ai et al. 2016; Wang et al. 2017; Wang et 
al. 2020; Wang et al. 2021), radial basis function (RBF) (Wang et al. 2014), LSTM (Huang et 
al. 2021; He et al. 2023; Cai et al. 2024), and combination models like ARMA model and 
RBF neural network (Li et al. 2013), and wavelet transform and SVM (Lei et al. 2014). 
Neural networks, with rapid learning and superior generalization capabilities, are ideal for 
nonlinear time series forecasting, with ongoing research focusing on optimizing parameters to 
enhance forecasting accuracy and stability. However, challenges such as local optimization, 
overfitting, and underfitting persist, necessitating further exploration in data preprocessing, 
model parameter selection, and training strategies. 

The SCB is characterized as a non-smooth and non-linear time series. Due to potential 
issues in the original SCB data, such as data loss, clock jumps, and gross errors (Feng 2009), 
utilizing the raw aberrant data quality may compromise the development and effectiveness of 
prediction models. Consequently, it is imperative to preprocess the initial SCB data 
post-acquisition to ensure the prediction model's reliability. Preprocessing techniques for the 
original SCB data typically involve methods such as the Median Absolute Deviation (MAD) 


(Feng 2009), the 30 criterion, and the enhanced MAD method (Wang et al. 2016; Huang et al. 
2022), etc.. This study employs the MAD method for data preprocessing. 

Previous research on predicting SCB has primarily focused on time domain analysis, 
neglecting the essential frequency domain information of the time series. To enhance 
prediction accuracy and stability by leveraging the frequency characteristics of SCB data, this 
study proposes utilizing Variational Mode Decomposition (VMD) (Dragomiretskiy and Zosso 
2014) to decompose the SCB time series into multiple mode components. VMD is a signal 
decomposition technique suitable for data with non-linear and non-smooth attributes like SCB. 
By decomposing the data into distinct frequency components and trend information, VMD 
enables a better understanding of underlying patterns in the data, such as long-term trends, 
seasonal variations, and cyclical fluctuations. Notably, VMD can effectively isolate noise or 
disturbances as low-frequency mode components, facilitating clearer observation and analysis 
of true trends and cyclical changes in the data, thereby enhancing prediction accuracy. 

Long Short-Term Memory (LSTM) represents a specialized form of recurrent neural 
network (RNN) distinguished by its superior memory capabilities and capacity to model 
extended dependencies (Yu et al. 2019). This is attributed to its distinctive architecture, which 
addresses the limitations of conventional RNNs in capturing lengthy sequences and 
effectively managing issues like gradient vanishing or explosion. Notably, LSTM excels in 
encoding sequential data, rendering it well-suited for addressing time series tasks such as 
SCB prediction. Despite the widespread application of LSTM in various domains for time 
series forecasting, the existing literature on SCB prediction utilizing LSTM remains relatively 
scarce (Huang et al. 2021; He et al. 2023; Cai et al. 2024), indicating a requirement for further 
investigation. 

This study represents the inaugural endeavor to integrate VMD with LSTM for SCB 
prediction. Initially, the original SCB data undergoes phase-frequency conversion, followed 
by preprocessing the SCB frequency data using the MAD method. Subsequently, the 
preprocessed data is decomposed utilizing VMD, with the decomposed data components fed 
into the LSTM network. The predicted values of each component are then aggregated and 
reconstructed to derive the final SCB prediction. The efficacy and benefits of the 
VMD-LSTM model in the realm of SCB prediction are validated through experimental 
analysis. The results of the experiments demonstrate notable enhancements in SCB prediction 
accuracy when applied to the spaceborne hydrogen atomic clock for BeiDou Navigation 
Satellite System (BDS). Specifically, the model's clock bias prediction results for 6 hours, 3 
days, and 15 days are 0.048 ns, 0.204 ns and 1.397 ns, respectively, showcasing superior 
performance compared to both the standalone LSTM network and BP neural network. The 
model's predictions exhibit a 56%, 84% and 83% improvement for 6 hours, 3 days, and 15 
days, respectively, over the LSTM network, and a 59%, 82% and 83% enhancement over the 
BP neural network. By enhancing the precision and reliability of SCB prediction, this 
research aims to establish a robust foundation for applications such as precise navigation, 
positioning, timekeeping, time synchronization, and frequency control. 


2 Variational Mode Decomposition 


The technique known as VMD was introduced by K. Dragomiretskiy et al. (2014) as a 
signal processing method. VMD is capable of breaking down a multi-component signal into 


individual amplitude modulation (AM) and frequency modulation (FM) signals 
simultaneously. Through an iterative process of optimizing a variational model, VMD 
identifies the frequency center and bandwidth of each signal component. This enables the 
adaptive dissection of the signal in the frequency domain and the effective isolation of each 
component. Specifically, if the original signal comprises K sub-signals with distinct center 
frequencies and finite bandwidths (referred to as Intrinsic Mode Functions, IMFs), VMD 
utilizes the Alternating Direction Multiplier Method (ADMM) and the Parseval theorem to 
iteratively update the center frequency and bandwidth of each mode. Subsequently, the modes 
are demodulated to their respective fundamental bands, resulting in the extraction of mode 
components with varying center frequencies. 

The essence of the VMD technique lies in formulating a variational problem and 
subsequently solving it. 

(a) Intrinsic Mode Function 

The VMD method redefines IMF within a limited frequency range by imposing stricter 
limitations, as formally represented through mathematical expressions: 

u,(t) = A, (t) cos(¢, (£) (2-1) 

In the given equation, 4,(t) represents the envelope amplitude of signal u,(t) , while 
¢,(t) denotes the instantaneous phase. 

(b) Development of Variational Problem 

The variational problem is subject to two primary constraints: firstly, the total bandwidth 
of the central frequencies of each modal component must be minimized; and secondly, the 
sum of all modal components must be equivalent to the original signal. 
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where u, = fuys. U ma represents the collection of mode functions, 


QO, = {0,,0,5...,0,} denotes the ensemble of center frequencies, O, signifies the partial 


t 
derivative of the function in relation to the variable time t, O(t) stands for the Dirac 
function, and the symbol * denotes the convolution operator. 

To minimize the bandwidth between individual modes derived from VMD and ensure 
their sum equals the original signal f, a series of steps are undertaken. Firstly, the Hilbert 
transform is applied to each mode function u,(t) to obtain the unilateral spectrum 


[om+ Zou, (t) . Subsequently, this spectrum is adjusted by multiplying it with the 


estimated center frequency e/® to align each mode's spectrum with the estimated center 
frequency. Finally, the bandwidth of each mode's signal is determined by employing Gaussian 
smoothing, specifically by calculating the square root of the gradient of the L, paradigm. 

(c) Addressing of sVariational Problem 

To determine the optimal solution for the variational constraint model described above, 
the Lagrange multiplier A and the quadratic penalty factor @ are incorporated to convert 
the inequality constraints into equational constraints. This transformation yields an augmented 
Lagrange expression: 
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where Œ serves to mitigate the effects of Gaussian noise, while A ensures the robustness of 


the constraint issue. 


Fig.l VMD Flowchart 
As illustrated 1 a L il VMD technique is executed through the subsequent procedures. 


(i) Initialize Wij}, fol}, P}, nO; 


(ii) Set the number of iterations n=n+1; 


(iii) When @>0, update the spectrum ù, of each mode accordingly: 
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(iv) Update the Lagrange multiplier 1: 
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Among which, £ is the noise tolerance parameter. For signals with strong noise, setting € 
to 0 can achieve better noise reduction effects. 
(v) Continue to perform Steps (ii) through (iv) repeatedly until the specified iterative 


constraints are met: 
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The variable © represents the precision of the discriminant, and o >Q. 


3 Long Short-Term Memory Network 


The Long Short-Term Memory (LSTM) network is an enhanced recurrent neural network 
(RNN) algorithm introduced by S. Hochreiter and J. Schmidhuber (1997). It incorporates 
specialized gating mechanisms to regulate the information flow and memory updating 
processes. These gating units, such as the input gate, forget gate, and output gate, utilize a 
combination of sigmoid activation functions and dot-multiplication operations to selectively 
retain or discard information. This selective retention enables the LSTM network to focus on 
pertinent information for making predictions, mirroring the cognitive processes of the brain in 
extracting and articulating key concepts. 

As illustrated in Fig.2, the symbols i, f, and o are utilized to represent the input gate, 
forget gate, and output gate, correspondingly. The symbol ® signifies the element-wise 
multiplication, while © denotes the element-wise addition. The weight matrix and bias 
vector of the network are represented by W and b, respectively. At time ¢, the input and output 
vectors of the LSTM hidden layer are denoted as x; and As, respectively. The memory cell is 


denoted as C;, with C.1 representing the preceding memory cell state. The outputs of the 
forget gate and input gate are represented by f; and is respectively, while the candidate 


information is denoted as C, . Specifically, o represents the sigmoid function, and tanh 


signifies the hyperbolic tangent function. 
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Fig.2 LSTM cell and Network Architectures 


(a) Input Gate 


The input gate is comprised of a sigmoid activation function and a hyperbolic tangent 
tanh activation function. The sigmoid function produces an output ranging from 0 to 1, 
determining the significance of information. On the other hand, the tanh activation function 
generates new candidate information denoted as Č within the range of -1 to 1. The product 
of the output i; from the input gate and the candidate information C, is calculated, and this 
result is utilized in the updating process of the memory cell. 

i, = oW, [h sX; ] a b,) 
C, = tanh(W,[h,_,.x, |+ be) 

(b) Forget Gate 

The forget gate is comprised solely of a sigmoid activation function that generates a 
value ranging from 0 to 1, determining the retention or discarding of memories. Additionally, 
it effectively mitigates the issues of vanishing and exploding gradients encountered during 
backward gradient propagation over time. The sigmoid function's output is directly applied to 
the previous memory cell state, preserving information with outputs closer to 1 and discarding 
information with outputs closer to 0. 

f= o(W,[h,,.x,]+b,) 
(c) Output Gate 


The output gate plays a crucial role in determining the specific information from the 
memory cell that is to be transmitted to the hidden state at the present moment, thereby 
selecting the segment of the memory cell to be conveyed at time ¢. This gate comprises both a 
sigmoid activation function and a tanh activation function. The sigmoid function is 
responsible for determining the pertinent information for output, while the tanh function is 
involved in processing the memory cell's state in readiness for output. The output from the 
sigmoid function is then multiplied by the memory cell's state that has been processed by the 
tanh function, resulting in the generation of the hidden state at the current time. 


C, = f, ®C,,+i,8C, 
o, =0(W,[h,..x,]+,) 
h, =o, ® tanh(C, ) 

The LSTM network is comprised of one or more LSTM cells, with a deep network being 
formed by stacking multiple LSTM cells. Initially, the output of each neuron is computed in a 
forward manner, followed by the calculation of the error term for each neuron through 
back-propagation of errors. Subsequently, the weight matrix and parameters are adjusted 
based on the principles of gradient descent. By iteratively updating the weight matrix and 
parameters within the LSTM using sample data, the network's overall error is minimized until 
it meets a predefined threshold or the maximum number of training iterations is reached, 
signifying the completion of LSTM training. The trained LSTM model can then be utilized 


for data prediction. 


4 SCB Prediction Model Construction based on VMD-LSTM 


As shown in Fig.3, the VMD-LSTM prediction model outlined in this study primarily 
comprises the subsequent procedures: 
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Fig.3 SCB Prediction Model based on VMD-LSTM 

(a) Data Preprocessing 

Initially, the time series data acquired undergo preprocessing to address potential issues 
within the original SCB dataset, such as missing data, clock jumps, and gross errors. Utilizing 
SCB data with quality concerns directly may compromise the development and effectiveness 
of the SCB prediction model. Therefore, it is imperative to preprocess the raw SCB data to 
ensure the reliability of the prediction model. Furthermore, the original SCB data is in phase 
form, typically featuring large values (typically ranging from E-4 to E-5), making it 
challenging to identify outliers directly. Consequently, the SCB phase data is transformed into 
frequency data, and any outliers within the SCB frequency data are addressed. This study 
employs the MAD method for outlier processing within the data. 

(b) Data Partitioning 

Following the removal of outliers, the data is partitioned into three distinct sets: the 
training set, validation set, and test set, as described in Fig.4. The training set is employed for 
model training purposes, the validation set aids in fine-tuning hyperparameters and model 
selection, while the test set is utilized to assess the overall performance and generalization 


capabilities of the final model. 
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Fig.4 Training Set, Validation Set and Test Set 


(c) Series Decomposition 

The time series data is analyzed through the VMD technique to extract multiple intrinsic 
mode components IMF; and a residual term. VMD requires the specification of parameters 
such as the mode number K, penalty factor @ , and convergence stopping condition © . The 
challenge and critical aspect of the process involve the selection of appropriate values for K 
and @,while © is typically set to a default value. Opting for a higher mode number may 
result in mode repetition or additional noise, whereas a lower number could lead to 
underdecomposition of the modes. The parameter @ primarily influences the bandwidth of 
the IMFi, with larger values narrowing the bandwidth of each IMF; component, and 
conversely, smaller values widening the bandwidth of the component. 

In this study, the K value is ascertained through the observation center frequency 
technique. The predetermined mode number K is systematically varied from small to large, 
and the optimal value of K is considered to be reached when the center frequency of the final 
layer of the IMF; component exhibits a consistent stability. 

(d) Data Normalization 

Data normalization involves the process of standardizing the data by mapping the input 
and output features of a sample dataset to a range of [-1,1]. This procedure aims to mitigate 
the impact of varying magnitudes within the data, thereby enhancing the convergence rate of 
the model. 

(e) Prediction Using LSTM 

Distinct LSTM network models are created for individual mode components IMF; to 
forecast outcomes. The optimization of hyperparameters plays a crucial role in enhancing the 
predictive precision of the LSTM network. The hyperparameters considered in the model 
outlined in this study encompass the quantity of layers, the dimensions of the hidden layer, 
the batch size, the random dropout rate, the epoch, the loss function, the optimizer, the 
learning rate, the training step, the prediction step, and the number of training iterations, etc.. 
To mitigate the risk of error accumulation, single-step prediction methodology is adopted in 
this investigation. 

(f) Prediction Component Reconstruction 

The anticipated values of individual mode components are combined and aggregated, 
followed by inverse normalization and frequency-phase conversion processing to derive the 
ultimate prediction outcomes for the SCB. 


(g) Model Evaluation Metrics 


This study employs five metrics to assess the efficacy of the SCB prediction model 


utilizing VMD-LSTM. These metrics include Mean Absolute Error (MAE), Mean Square 
Error (MSE), Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), 
and R-Square (R-2). 
Suppose the predicted values y= TAS and the true values y= DoYo, r 
MAE signifies the average absolute error and provides insight into the accuracy of the 


predicted values, and the greater the error, the greater the value. 


MAE = Le 
N i=l 
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MSE represents the ratio of the sum of squares of the deviation from the predicted and 
actual values to the total number of samples. 
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RMSE is utilized to determine the square root of the sum of squared deviations between 


predicted and actual values relative to the total sample size, and the greater the error, the 
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MAPE can eliminate the impact of dimension and be used to evaluate bias objectively. 


greater the value. 
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Additionally, R2, a measure of goodness of fit, is employed to evaluate the degree to 
which the fitted values align with the observed data. Generally speaking, the larger the 
R-Square, the better the model fitting effect. 
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5 Case Analysis 


This study utilizes the 300-second precision SCB product of BDS provided by the 
International GNSS Monitoring and Assessment Service (IGMAS) for a duration of 106 days 
spanning from Sep. 6th to Dec. 20th, 2023 (denoted as 20230906-20231220). In order to 
evaluate the accuracy and universality of SCB prediction using VMD-LSTM model, this 
paper divides these 106-day data into five segments every 5 days, namely, 
20230906-20231130 (Segment 1), 20230911-20231205 (Segment 2), 20230916-20231210 
(Segment 3), 20230921-20231215 (Segment 4), and 20230926-20231220 (Segment 5). The 


training of the model involves using data from the initial 70 days as the training set and data 
from the subsequent day as the validation set. SCB predictions are conducted for varying 
durations of 6 hours, 3 days and 15 days. 

The spaceborne atomic clocks utilized by BDS in orbit predominantly consist of 
Rubidium (Rb) and Hydrogen (H) atomic clocks, with specific types of atomic clocks 


currently operational in orbit detailed in Table 1. 
Table 1 Spaceborne Atomic Clock Types of BDS 


Types PRN 
GEO 01(Rb) 02(Rb) 03(Rb) 04(Rb) 05(Rb) 59(H) 60(H) 
IGSO 06(Rb) 07(Rb) 08(Rb) 09(Rb) 10(Rb) 16(Rb) 38(H) 39(H) 40(H) 


11(Rb) 12(Rb) 13(Rb) 14(Rb) 19(Rb) 20(Rb) 21(Rb) 22(Rb) 23(Rb) 
MEO 24(Rb) 25(Rb) 26(H) 27(H) 28(H) 29(H) 30(H) 32(Rb) 33(Rb) 34(H) 
35(H) 36(Rb) 37(Rb) 41(Rb) 42(Rb) 43(H) 44(H) 45(H) 46(H) 

In this study, the VMD-LSTM model is utilized to investigate the prediction 
performance of clock bias in hydrogen clocks within BDS. As a result of the limited 
availability of GEO data pertaining to BDS provided by IGMAS during the specified 
timeframe, two satellites PRN27 (H) and PRN40 (H) in inclined geosynchronous orbit (IGSO) 
and medium Earth orbit (MEO) were randomly selected for experimental analysis. An 
illustration of PRN27 (Segment 1) is provided below for reference. 

5.1 Data Preprocessing of PRN27 
The initial phase data of PRN27 during the timeframe from Segment 1 is presented as 


Fig.5 below. 


x104 


0 Os 4 15 2 2.5 
Epoch(Smin) x104 
Fig.5 Atomic Clock Phase Data of PRN27 
As depicted in Fig.5, it is evident that the outliers are not readily discernible from the 
initial phase data. Consequently, a phase-frequency conversion process is conducted to derive 


the atomic clock frequency data, as illustrated in Fig.6. 
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Fig.6 Atomic Clock Frequency Data of PRN27 
In the atomic clock frequency data presented in Fig.6, certain data points that deviate 


significantly from the overall trend can be readily distinguished. In this study, these outliers 


are eliminated through the employment of MAD (set the threshold factor equal to 3), whereby 
the outliers are replaced through linear interpolation. Subsequently, Fig.7 displays the 
frequency data pertaining to PRN27 subsequent to the removal of outliers. 
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Fig.7 Atomic Clock Frequency Data of PRN27 after Preprocessing 


5.2 Decomposition Using VMD 
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Fig.8 Frequency Data Decomposition for PRN27 Using VMD 
Here, VMD is utilized to ascertain the quantity of sequence decompositions, with a total 
of three sequence decompositions selected for analysis. The decomposition outcomes of 
VMD applied to the atomic clock frequency data pertaining to PRN27 are illustrated in Fig.8. 


The figure displays the original waveform in the first column, while columns 2-4 represent 
the decompositions from high-frequency to low-frequency sequences. The final column 
exhibits the residual component of the decomposition process. 
5.3 Prediction Using LSTM 

Following the decomposition of the atomic clock frequency data through VMD, the data 
is subsequently normalized for each component and fed into the LSTM network model. The 
hyperparameters for the LSTM model utilized in this study were determined as outlined 


below: 
Table 2 Hyperparameter Selection for LSTM Network Model 


No Parameter Value 
1 Loss Function MSE 
2 Optimizer Adam 
3 Number of LSTM Layers [32,32] 
4 Number of Training Rounds 30150 
5 Input Dimension Size 10 
6 Output Dimension Size 1 
7 Batchsize 100 
8 Learning Rate 0.0002 
9 Epoch 150 


The frequency prediction performance of PRN27 for 15 days are presented in Fig.9 as 
follows: 
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Fig.9 Frequency Prediction Performance of PRN27 for 15 days 
5.4 SCB Prediction Results 
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Fig.10 The cleaned and predicted FDSCB of PRN27 for 15 days 
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Fig.11 The cleaned and predicted FDSCB of PRN40 for 15 days 


Upon the completion of the frequency prediction, the predicted frequency information is 
transformed into phase data (predicted), with the prediction outcome of SCB being 
determined by the RMSE between the cleaned and predicted phase data. The first difference 
of SCB (FDSCB) for PRN27 and PRN40 for 15 days in Segment 1 are shown in Fig.10 and 
Fig.11, where the the blue curve is the true value (cleaned) of FDSCB, and the red one is the 
corresponding predicted value. 

Table 4 illustrates the clock bias prediction outcomes for two BDS satellites, namely 


PRN27 and PRN4O, utilizing VMD-LSTM methodology for Segments 1 to 5. 
Table 4. SCB Prediction Results (RMSE) of PRN27 and PRN40 Using VMD-LSTM (ns) 


Time PRN27 (MEO) PRN40 (IGSO) 
6 hours 3 days 15 days 6 hours 3 days 15 days 
Segment 1 0.082 0.426 1.202 0.067 0.599 1.795 
Segment 2 0.026 0.113 0.629 0.047 0.194 0.477 
Segment 3 0.029 0.163 0.828 0.036 0.152 0.938 
Segment 4 0.048 0.069 0.645 0.100 0.231 1.458 
Segment 5 0.029 0.037 1.902 0.011 0.048 4.090 
Average 0.043 0.162 1.041 0.052 0.245 1.752 


(Segments1-5) 

5.5 Comparison and Analysis 
To evaluate the efficacy of SCB prediction utilizing the VMD-LSTM model, this study 
employs the LSTM model and the BP neural network model to forecast the identical dataset. 


The outcomes are presented in Table 5 and Table 6 as follows: 
Table 5. SCB Prediction Results (RMSE) of PRN27 and PRN40 Using LSTM (ns) 


Time PRN27 (MEO) PRN40 (IGSO) 
6 hours 3 days 15 days 6 hours 3 days 15 days 
Segment 1 0.350 2.553 8.875 0.192 1.845 9.620 
Segment 2 0.108 0.771 1597 0.050 1.287 7.468 
Segment 3 0.057 0.552 6.432 0.087 0.695 6.432 
Segment 4 0.062 0.828 7.799 0.065 1.500 8.429 
Segment 5 0.056 1.583 9.437 0.057 0.991 10.840 
Average 0.127 1.257 8.020 0.090 1.264 8.558 


(Segments1-5) 


Table 6. SCB Prediction Results (RMSE) of PRN27 and PRN40 Using BP (ns) 


Time PRN27 (MEO) PRN40 (IGSO) 
6 hours 3 days 15 days 6 hours 3 days 15 days 
Segment 1 0.255 2.366 9.557 0.221 1.772 8.276 
Segment 2 0.025 0.650 8.033 0.151 1.221 8.027 
Segment 3 0.043 0.494 6.462 0.091 0.607 6.059 
Segment 4 0.068 0.588 8.316 0.188 1.209 5.523 
Segment 5 0.031 1.281 10.344 0.091 0.997 12.715 
Average 0.084 1.076 8.542 0.148 1.161 8.120 


(Segments1-5) 
We use the average value of five different segments to represent the prediction results of 
SCB using the three methods, and the RMSE comparison is presented in Fig.12. 
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Fig.12 RMSE Comparison of PRN27 and PRN 40 for 6 hours, 3 days and 15 days 
The detail comparison results of the evaluation metrics of the VMD-LSTM, LSTM, and 


BP neural network prediction models are shown in Fig.13 and Fig.14. To facilitate 
comparison, we normalize each metric into a single dimension. Obviously, each metric of the 
SCB prediction model based on VMD-LSTM is optimal. 
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Fig.13 Metrics Comparison of PRN27 for 6 hours, 3 days and 15 days 
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Fig.14 Metrics Comparison of PRN40 for 6 hours, 3 days and 15 days 


This paper takes the average prediction results of PRN27 and PRN4O0 satellites for 6 
hours, 3 days, and 15 days as the predicted results of each model for the SCB of BDS 
hydrogen atomic clock. The analysis reveals that, under the VMD-LSTM model, the 
predicted SCB values for 6 hours, 3 days and 15 days are 0.048 ns, 0.204 ns and 1.397 ns, 
respectively. In contrast, the LSTM model predicts SCB values of 0.109 ns, 1.261 ns and 
8.289 ns for the same time intervals. Similarly, the BP neural network model forecasts SCB 
values of 0.116 ns, 1.119 ns and 8.331 ns, respectively. 

Comparing the results presented in Tables 4-6 and Figs.11-13, it is evident that the SCB 
predictions derived from the VMD-LSTM model exhibit superior performance compared to 
those from the LSTM and BP neural network models. Specifically, the VMD-LSTM model's 


predictions for 6 hours, 3 days and 15 days are approximately 56%, 84% and 83% higher than 
the LSTM model, and 59%, 82% and 83% higher than the BP neural network model. 

There are two reasons why the VMD-LSTM-based SCB prediction model established in 
this paper is effective: Firstly, transforming phase data into frequency data for prediction 
makes the prediction results closer to the true values. We verified this point by directly 
predicting phase data after removing outliers using VMD-LSTM, but the effect was not as 
good as predicting with frequency data and then converting them into phase data. The 
possible reason is that the phase-frequency conversion is equivalent to a low-pass filter, which 
has the effect of filtering out noise. Secondly, signal decomposition improves the prediction 
accuracy. By decomposing the signal into multiple mode components using VMD and taking 
them as inputs for the LSTM model, the local features and temporal-spatial relationships of 
the signal can be better captured, thereby improving the model’s prediction ability and 
generalization ability. 


6. Conclusions 


The precise estimation of the clock bias in atomic clocks holds significant importance for 
accurate timekeeping, navigation and positioning. This study introduces a novel approach for 
forecasting atomic clock bias by integrating signal decomposition and deep learning 
techniques, specifically utilizing VMD in conjunction with LSTM neural networks. Empirical 
findings demonstrate that for the BDS satellite hydrogen atomic clock, the projected clock 
errors over 6 hours, 3 days and 15 days are 0.048 ns, 0.204 ns and 1.397 ns, respectively, 
showcasing notable enhancements compared to both the LSTM and BP neural network 
models. 

Decomposing time series data can enhance prediction accuracy by enabling a more 
comprehensive understanding of the data components, facilitating the selection of suitable 
forecasting methods based on trends, seasonality, and residuals. Trend analysis aids in 
forecasting long-term patterns, seasonality analysis predicts periodic fluctuations, and residual 
analysis identifies anomalies or enhances forecasting models. VMD is a signal processing 
technique used to decompose complex time series signals into multiple local frequency 
patterns that can reflect different frequency components and amplitude variations in the signal. 
The main idea of VMD is to decompose the signal into multiple IMFs, which have different 
frequency and amplitude characteristics and are orthogonal to a certain extent, through 
optimization problems. Compared with traditional signal decomposition methods, VMD can 
better adapt to nonlinear and non-stationary signals, and has better local feature extraction 
capability. Consequently, employing an LSTM model to predict each data component leads to 
outcomes that closely align with actual values. By enhancing the precision and reliability of 
atomic clock error predictions, this study aims to establish a robust foundation for precise 
navigation, positioning, timekeeping, time synchronization, frequency regulation, and related 
applications. Future research will focus on optimizing LSTM model parameters and 
extending its application within the realm of time establishment and timekeeping. 
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